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Abstract 

Exponential families are a particular class of statistical manifolds which are particularly important 
ry* | in statistical inference, and which appear very frequently in statistics. For example, the set of normal 

distributions, with mean fi and deviation a , form a 2-dimensional exponential family. 

In this paper, we show that the tangent bundle of an exponential family is naturally a Kahler 

\^ , manifold. This simple but crucial observation leads to the formalism of quantum mechanics in its 

^**^ ■ geometrical form, i.e. based on the Kahler structure of the complex projective space, but generalizes 

also to more general Kahler manifolds, providing a natural geometric framework for the description of 

i-Q , quantum systems. 

Many questions related to this "statistical Kahler geometry" are discussed, and a close connection 
with representation theory is observed. 

Examples of physical relevance are treated in details. For example, it is shown that the spin of a 
particle can be entirely understood by means of the usual binomial distribution. 

This paper centers on the mathematical foundations of quantum mechanics, and on the question 
^ ■ of its potential generalization through its geometrical formulation. 

O . i Introduction — summary 

In the 70's, it has been observed by Chernoff and Marsden JCM74J that the Schrodinger equation ih-$r = 
^-\ \ Hip is Hamiltonian with respect to the symplectic form coming from the imaginary part of the Hermitian 

scalar product of the Hilbcrt space T-L of possible quantum states. Since then, this Hamiltonian view on 
quantum mechanics has been developed independently by several authors |CL84[ ICMP901 IHes84[ IHes85[ 
IKib79j and has led to a complete geometrization of the quantum formalism, entirely based on the Kahler 
S^ ' properties of the complex projective space P("H) . This reformulation, which is very elegant and complete, 

is now usually referred to as the geometrical formulatioix] of quantum mechanics [AS991 ICMP90] . 

The geometrical formulation was mainly motivated by the desire to generalize quantum mechanics, 
especially in view of quantum gravity. The basic idea is that, by geometrizing the quantum formalism, 
one frees it from its burdensome linearity and put it on a geometrical ground akin to Einstein's theory 
of gravitation. Geometry is, in this regard, particularly "flexible" , and seems an appropriate setting for 
generalizations. For example, while it is not clear how to generalize Hilbert spaces, generalizations of the 
complex projective space is straightforward: instead of P(ri) , take an arbitrary Kahler manifold. Such 
possibilities have been discussed in |Gib92[ |Hug 95 , Kaw] IKib79j and applications towards quantum gravity 
have been proposed in [MT031 [MT04] . 
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x Not to be confused with the geometric quantization of Kostant and Souriau Kos70, Sou97 . In the geometrical formu- 
lation, the Hilbcrt space is considered as given, not as the result of a quantization scheme. 



These proposals, however, are limited in their original scope by a severe limitation: the need to allow for 
a probabilistic interpretation. The latter, contrary to the quantum state space and dynamics, is extremely 
difficult to generalize in a purely geometrical context and is usually not even discussed. The reason is 
that in the geometrical formulation, all formulas related to probabilities rely on an expression of the form 
cos 2 (d( , )) , where d( , ) is the geodesic distance on P("H) , expression which is clearly specific to ¥(H) and 
which, consequently, cannot be generalized directly to arbitrary Kahler manifolds. 

These difficulties -related to the probabilistic interpretation- address the following question: what is 
the link between the Kahler structure o/P(C ra ) and probabilities? This question, which is central in the 
present work, was already formulated in our previous paper jMolbj where an interesting, though puzzling 
connection with information geometry has been observed. 

Let us recall, in this regard, that information geometry is a branch of statistics characterized by its use 
of differential geometrical techniques jANOOl [MR93J . Its basic objects of study are statistical manifolds, 
i.e. manifolds whose points can be identified with probability density functions over some fixed measured 
space. For example, Gaussian distributions over R form a 2-dimensional statistical manifold parameterized 
by the mean \x and deviation a . In general -and this is what information geometry is about- a statistical 
manifold S possesses a rich geometry that encodes many of its statistical properties; it has a Riemannian 
metric hp , called Fisher metric, and a pair of dual affine connections V^' , V( m ) , respectively called 
exponential connection and mixture connection, which can be used, for example, to give lower bounds in 
estimation problems (compare e.g. the Cramer-Rao inequality). Together, the triplet (hp, V*- e -\ V'-" 7- -') 
forms what is called a dualistic structure, and it is probably the most important structure in information 
geometry 

Very little attention has been paid to dualistic structures outside the statistical community, but we 
can mention, in connection with [Molbj , the work of Dombrowsko In a paper which already goes back to 
the 60 's |Dom62j . Dombrowski shows that if a manifold M is endowed with a dualistic structure (M needs 
not be a statistical manifold here) , then its tangent bundle TM becomes naturally, via a simple geometric 
construction, an almost Hermitian manifold_|. A direct consequence of Dombrowski's construction, which 
seems to have been unnoticed in the existing literature, is that the tangent bundle of a statistical manifold 
is canonically an almost Hermitian manifold. 

This observation, although mathematically very simple, is one of the most important of |Molbj . It 
tells us that statistics abounds with almost Hermitian manifolds. To illustrate this, let us consider what is 
probably the most simple example that one may think of. Take a finite set Q := {xi, ...,x n } and consider 
the space V* of nowhere vanishing^! probabilities p : fl — > R , p > , 52fc=i P( x k) = 1 • This is a (n— 1)- 
dimensional statistical manifold, therefore its tangent bundle TV* is an almost Hermitian manifold. Now 
the main observation in MolE] may be formulated as follows: the canonical almost Hermitian structure of 
TV* is locally isomorphic to the Kahler structure o/P(C n ) . 

This result is intriguing. On one hand, it establishes a link between the geometrical formulation of 
quantum mechanics and information geometry, and suggests a possible information-theoretical origin of 
the quantum formalism. But on the other hand, the statistical relevance of Dombrowski's construction is 
not at all clear, and since V* is the only example in |Molbj for which explicit computations arc performed, 
their arc a priori no reasons for other statistical manifolds to yield interesting geometrical results of physical 
importance. 

As such, the results in |Molb| are potentially fruitful, but they raise many questions that need further 
investigations. 

In the present paper, we developed some of the ideas of [Molbj and present mathematical results -at 



2 See also INS92| . 

3 In IDom62| , Dombrowski is not explicitly using the language of dualistic structures, and his main concern is on the 
analytical properties of the almost complex structure that he constructs on the tangent bundle TM of a Riemannian manifold 
(M, g) endowed with a connection V . 

4 The condition p > (instead of p > 0) is purely technical and ensures that Vn has no boundary nor corners. 



the crossroad of information geometry, Kahler geometry, functional analysis and, to some extent, repre- 
sentation theory- which reinforce the idea that the quantum formalism has a statistical and information- 
theoretical origin. In particular, we want to describe a mechanism by which Kahler geometry emerges 
from information geometry, and to explain, by revisiting the geometrical formulation, how the latter, in 
its various aspects, can be understood within this larger information-theoretical setting. 

A central role, in this development, is played by the so-called exponential families. Exponential families 
are a particular class of probability distributions which plays a key role in various ramifications of statistics, 
and especially in the context of statistical inference (see for example jANOOj ). Their importance stems from 
the fact that among all possible parameterized statistical models, they are the only ones having efficient 
estimators, meaning roughly that it is possible, given sample data, to estimate the unknown parameters 
of the model in the "best possible way" . Examples of exponential families are found among the most 
common probability distributions: Bernoulli, beta, binomial, chi-square, Dirichlet, exponential, gamma, 
geometric, multinomial, normal, Poisson, to name but just a few. 

For us, the important property of an exponential family £ is that its canonical almost Hermitian 
structure (on T£ ) is always a Kahler structure (Corollary 14. 4[) . allowing us to define, via some refinements 
of Dombrowski's arguments, what we shall call the Kahler ification of an exponential family, denoted £ 
(see $10 Definition 16. ip . By construction, a Kahlcrification £ c is a Kahler manifold, and it always comes 
with a Ricmannian submersion n£ : £ c — >• £ . As an important example, the Kahlcrification of T* yields 
P(C") x ,i.e. (T*f =P{C n ) x , where P(C n ) x := {[zi,...,z n ] G P(C n ) | z k ^ for all k = 1, ..., n} (we use 
homogeneous coordinates, see $7] and Proposition 17. 5|) . Other classical Kahler manifolds can be realized 
as the Kahlcrification of appropriate exponential families, like C™ and the Poincarc upper half plane H . 

Kahlcrifications (and their completions) generalize the usual quantum state space P(C") . But what 
about the observables? In the geometrical formulation, observables are Kahler functions, i.e. functions 
/ : P(C") — > R whose associated Hamiltonian vector fields Xf are Killing vector fields. 

In Sj5l we investigate the properties of Kahler functions in the context of Kahlerification and obtain a 
relation between the statistical structure of £ and a class of Kahler functions on £ c , as follows. If ($7, dx) 
denotes the measured space on which £ is defined and if A : fl — > R belongs to a certain class of random 
variables which depends on the exponential structure of £ (see (|36p). then for any holomorphic isomctry 
$ : £ c — >• £ c , the function 

£ C ^R, z^ I X(x)[(n £ o$)(z)](x)dx (1) 
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is a Kahler function ( Corollary HT 

This result is a "geometric analogue" of the usual spectral decomposition theorem for Hermitian ma- 
trices. For, when £ = V~* , then the space of Kahler functions J^(P(C™)) on P(C") (the latter viewed as 
the natural "completion" of ("P x ) c ) , is isomorphic in the Lie algebra sense to the space of n x n skew 
Hermitian matrices u(n) , i.e., J?T(P(C™)) = u(n) , and the decomposition in (jTJ) is in this case a rephrasing 
of the diagonalisability of a Hermitian matrix (see Lemma 17.61) . 

In SJ5J while revisiting the geometrical formulation of quantum mechanics, we use this correspondence 
with spectral theory to propose a definition for the spectrum of a Kahler function / : £ c — > M of 
the form given in (TTJ). Our definition reads as follows: spec(/) := Im(A) , where Im(A) denotes the 
image of the random variable A : Q — >• R . As the decomposition in (TTJ is usually not unique, our 
definition is only consistent when invariance properties are met. We also define, for a Kahler function 
/ as in P and a point z G £ c , what might be interpreted, in a physical jargon, as the probability 
that the observable / yields, upon measurement, the eigenvalue A while the system is in the state z : 
Pf,zW '= Sx-HX) [foe *)(*)] (aOefc. 

When £ = V* , then spec(/) and Pf iZ are well defined for all Kahler functions / on P(C") (we extend 
our definitions via density arguments) and together, they yield the usual probabilistic interpretation of 
the geometrical formulation. In particular, Pf_ z depends on the expression cos 2 (d( , )) , where d{ , ) is the 
geodesic distance on P(C") . 



When £ = B(n,q) is the space of binomial distributions with parameter q e]0, 1[ defined over il := 
{0, ..., n} , then we have the following results (see $§]). The Kahlerification of B(n, q) is, up to completion, 
the 2-dimensional sphere of radius n (regarded as a submanifold of K 3 ). The space of Kahler functions on 
the sphere is generated by the functions 1, x, y, z and is isomorphic, in the Lie algebra sense, to u(2) . For 
the spectral theory, if (u, v, w) £ R 3 is a vector whose Euclidean norm is n/2 , then the spectrum (in our 
sense) of the function f(x, y, z) = ux + vy + wz is exactly {—j, —j + 1, •■■,.? — l,i} , where j — n/2 , and 
the probability Pt< x> yz) IS given, for k g {0, ...,n} , by 

Pm*,v,z)(-3 + k ) = Q ( cos2 (^/2)) fe (sin 2 (0/2))"~* , (2) 

where (£) = (n _"; ) , fc , and where 9 is an angle satisfying "j*^™)!* = cos(9) . 

We recognize -and this is one of the main observations of this paper- a formula which describes the spin 
of a particle passing through two consecutive Stern-Gerlach devices. Recall that a Stern-Gerlach device 
produces a magnetic field oriented in a chosen direction, and that if a beam of particles (for example silver 
atoms) is send through it, then, due to spin effect, it will split into a finite number of deflected parts. 
Equation @, in this respect, gives the probability that a particle entering the second Stern-Gerlach device 
with maximum spirj£| is deflected into the (— j + fc)-th outgoing beam, 9 being the angle between the two 
magnetic fields produced by the Stern-Gerlach devices. 

What is remarkable with this result is that it only depends on the statistical structure of the binomial 
distribution B(n, q) , providing support to the idea that the quantum formalism owes part of its mathemat- 
ical structure to statistical concepts. Also, it shows that the probabilistic interpretation of the geometrical 
formulation, through spec(/) and P/ iZ , can be extended to more general situations than the one originally 
considered with the complex projective space, situations which are physically relevant. 

We have to emphasis, however, that not all the possibilities of the Stern-Gerlach experiment are 
exhausted with ([2]), which may be interpreted, at first, as a limitation of the statistical approach. But 
actually it is not. The remaining probabilities, as it turns out, can be obtained fairly easily by means 
of the "universal" inclusion^! B(n, q) C V* +1 , as follows. By "Kahlcrifiying" this inclusion, one gets an 
embedding S 2 M> P(C™ +1 ) which makes it possible to extend every Kahler function / on S* 2 to a unique 
Kahler function / on P(C n+1 ) , the latter function having the advantage to carry more informations than 
the original one. In fact, we show that the map / t— > f is a homomorphism of Lie algebras which is, via the 
appropriate identifications, an irreducible unitary representation of u(2) (see Proposition 19 . 71 and lemmas 
I9.8l and f9.9[) . This allows us to extract the remaining probabilities (recall that the spin is usually described 
by the unitary representations of su(2)). Mathematically, this brings an interesting link between a purely 
geometrical problem - extending Kahler functions- and representation theory 

Collecting our results, we conclude that the spin of a particle can be entirely understood by means of 
the binomial distribution B(n, q) . 

In iflOl we briefly consider the space A/"(/x, 1) of Gaussian distributions of mean fi and fixed deviation 
a = 1 over SI = K , give its Kahlerification and describe its associated "spectral theory" . As we observe, 
this exponential family is closely related to the quantum harmonic oscillator, a fact which can only be 
fully understood by the introduction of an infinite dimensional analogue of "P* . On this, however, we say 
very little due to space limitation and refer the reader to |Molaj . 

To summarize, we carried out, following ideas of [Molbj . an analysis of the mathematical foundations 
of quantum mechanics, using a geometric and information-theoretical approach, which points towards the 



5 By "maximum spin" wc mean that the eigenvalue of the usual spin operator of the particle along the direction of 
the magnetic field of the first Stern-Gerlach device, is, among the possible values —j,—j + l,...,j — 1,J, exactly j. Here 
j S {0, 1/2, 1, 3/2, ...} is the spin of the particle. 

6 By "universal", we simply mean that for any statistical manifold S defined over a finite set {xi, ...,x„} , there is a 
canonical inclusion S C Vn ■ The space Vn thus appears as a "universal container". 



following conclusion: the quantum formalism is grounded on the Kahler geometry which naturally emerges 
from statistics. Examples like the spin support this claim, and the various mechanisms involved have been 
described; we defined the Kahlerification of an exponential family and sketched the very basis of what 
may be considered as a "statistical spectral" theory for Kahler functions. In doing so, we observed an 
intriguing link between the problem of extending Kahler functions and representation theory which seems 
to connect our approach to the standard way physicists work. 

The author is fully aware that the techniques and definitions introduced in this paper are still in an 
infant stage, and that they should probably be modified in the light of further progress. Nevertheless, it is 
likely that the relationship between statistics and Kahler geometry will grow in importance, and we hope 
that it may help to get a better understanding of the mathematical foundations of quantum mechanics. 
Deepening this comprehension might well led to a viable generalization of quantum mechanics, or at 
least to a new comprehension of some of its conceptually puzzling aspects, especially those related to the 
measurement problem. 

2 Information geometry 

In this section, we review the basic concepts of information geometry needed throughout this paper. Our 
(very short) presentation follows the currently reference book [ANOOj whose emphasis is on the Fisher 
metric and a-connections of a given statistical model (see also |MR93j ). 

A statistical manifold (or statistical model), is a couple (S, j) where S is a manifold and where j is an 
injective map from S to the space of all probability density functions p defined on a fixed measured space 
(0, dxf\ : 

In the case of a discrete space il , it will be implicitly assumed that dx is the counting measure, i.e. 
dx{A) = card(A) , where card(A) denotes the cardinality of a given subset A C O . In this situation, 
integration of a function X : O — >• K with respect to the probability p dx (p being a probability density 
function), is simply given by : 

f X{x)p{x)dx = Y,X{x)p{x). (4) 

As a matter of notation, if (£ : U C S —> R") is a chart of a statistical manifold S with local coordi- 
nates £ = (£i, ...,£ n ) , then we shall indistinctly write p(x;^) or p^(x) for the probability density function 
determined by £ and in the variable i£fl. 

Now, given a "reasonable" statistical manifold S , it is possible to define a metric hp and a family of 
connections V^"-* on S (a € R) in the following way: for a chart £ = (£i, ..., £„) of S , define 

• (h F )i (di,dj) := E Pi (d,\n (p ? ) • <9,ln ( Pi )) , 

1-a 



r£L(0 := E Pi [(didjln (p c ) + -^dM fa) ■ a,.]n fo)) 9 fe ln (p e ) 



where E P( denotes the mean, or expectation, with respect to the probability p^ dx , and where di is a 
shorthand for d/d^ i . 



7 Dcpcnding on the symbolo wc use for the variable living in C , for example "x" , "k" , etc., we shall use the notation "dx" 
"dk" , etc., for the measure on Q . 



It can be shown that if the above expressions are defined and smooth for every chart of S (this is not 
always the case), then hp is a well defined metric on S called the Fisher metric, and that the F^.' s 
are the Christoffel symbols of a connection V^ called the a- connection. Among the a-connections, the 
(±l)-conncctions are particularly important; the 1-connection is usually referred to as the exponential 
connection, also denoted V' e ' , while the (— l)-connection is referred to as the mixture connection, denoted 
y( m ) _ 

In this paper, wc will only consider statistical manifolds S for which the Fisher metric and a-conncctions 
are well defined. 

One particularity of the (±a)-connections is that they are dual of each other with respect to the Fisher 
metric hp , or equivalently, that they form a dualistic structure on S . The general definition of a dualistic 
structure on an arbitrary manifold M is as follows: a dualistic structure on M is a triple (h, V, V*) where 
h is a Riemannian metric on M and where V and V* are connections satisfying 

X(h(Y,Z)) = h(V x Y,Z)+h(Y,V x Z) , (5) 

for all vector fields X, Y, Z on M . The connection V* is called the dual connection, or conjugate connection, 
of the connection V (and vice versa jj. 

An example of dualistic structure is, as we already said, given by the triple (hp, V^ a \ V^ - "-*) that one 
can always consider for a fixed a g 1 on a statistical manifold S (provided of course that the Fisher metric 
and (zta)-connections exist). 

An important class of dualistic structures is that of dually flat structures. A dually flat structure on 
a manifold M is a dualistic structure (h, V, V*) for which both connections are flat, meaning that their 
torsions and curvature tensors vanish. As conventions are not uniform in the literature, let us agree that 
the torsion T and the curvature tensor R of a connection V on M are defined as 

T{X,Y) := V X Y- VyA - [X,Y], 
R(X,Y)Z := V x VyZ-VyVxZ-V [x ,y]Z, (6) 

where X, Y, Z are vector fields on M . 

Given a dualistic structure (h, V,V*) on a manifold M , there exists a simple relation between the 
curvature tensor R of V and the curvature tensor R* of V* which is given by the following formula: 

h(R{X, Y)Z, W) = -h(R*(X, Y)W, Z) , (7) 

where X, Y, Z, W are vector fields on S . From this relation, it is clear that if V and V* are both torsion- 
free, then (h, V, V*) is dually flat if and only if R or R* vanishes identically (in which case both curvature 
tensors vanish). In particular, since a-connections are always torsion-free, (hp, V^ Q \ V^ _Q ^) is dually flat if 
and only if R^ or i?(~ Q ) vanishes identically (here R^ denotes the curvature tensor of the a-connection). 

3 Exponential families 

Definition 3.1. An exponential family £ on a measured space (0, dx) is a set of probability density 
functions p(x; 6) of the form 

P (x; 9) = exp <j C{x) + V $& (x) - He) \ , (8) 



cxpic(x)+J2e i F l (x)-iie)\, 

*- i=l ' 



8 Givcn a connection V on a Riemannian manifold (M, h) , there exists a unique connection V* on M such that (0 holds; 
it is thus justified to call V* the dual connection of V . 



where C, F\ 1 ..., F„ are measurable functions on fi , 6 = {9\, ..., 9 n ) is a vector varying in an open subset 
of R" and where tf> is a function defined on . 

In the above definition, it is understood that if fi is discrete, then dx should be the counting measure. 
It is also assumed that the family {1, Fi, ..., F n } is linearly independent, so that the map p(x, 9) i-> 6 <G 
becomes a bijection, hence defining a global chart off . The parameters 6i,...,6„ are called the natural or 
canonical parameters of the exponential family £ . 

Besides the natural parameters 9\,...,9 n , an exponential family £ possesses another particularly im- 
portant parametrization which is given by the expectation or dual parameters r\\, ..., r\ n : 

TH(pe) ■= E PB {Fi) = f F t (x) Pe (x) dx . (9) 

It is not difficult, assuming ip to be smooth, to show that rji(pe) = dg^ . The map i] = (771, ...,T? n ) is thus 
a global chart of £ provided that (dg^, ■•■, dg n ip) : — > R" is a diffeomorphism onto its image, condition 
that we will always assume. 

The natural and expectation parameters are important in that they form affine coordinate systemqj 
with respect to V^ e ^ and V^™) : 

Proposition 3.2. \AN00tf Let £ be an exponential family such as in ©. Then (£, hp, V^ e ', V^) is 
dually flat and 9 = (#i, ...,9 n ) is an affine coordinate system with respect to S7^ e ' while r\ = (771, ...,r/„) is 
an affine coordinate system with respect to S7^ m ' . Moreover, the following relation holds : 

hp{dg i ,d Vj ) = d ij , (10) 

where dij denotes the Kronecker symbol. 

Let us now give some examples of exponential families, mostly taken from [ANOOj . 

Example 3.3 (Normal Distribution). Normal distributions 

p(x;n,a) = ^^cxp{ - {x ^ } (i£l), (11) 



form a 2-dimensional statistical manifold parameterized by (/i, a) € R x Rl (here Rl := {x £ R | x > 0}), 
subsequently denoted Af([i,o~ 2 ) . This family is easily seen to be an exponential one, for one may write 

p(x; /j, a) = expj — x - i^x 2 - ^1 ~ Mv^o") } > ( 12 ) 

and define 

«i = 4. *2 = -i, C(x) = 0, F 1 (x)=x, F 2 (x) = x\ i>(6) = -^ 



) 2 
Example 3.4 (finite 0). For a /rniie sei fi — {x\, ...,X n } , define 



2(j2 , ~v~, -, -.V-, -, -*v~, - , TV, 46», ' 2 



M"f)- (13) 



P* := [p : fi -> R J p{x) > for all x € fi and ^ p(a; fe ) = l| . (14) 

fc=i 



9 Let us recall that an affine coordinate system on a manifold M with a flat connection V , or simply a V-affine chart, is 
a coordinate system in which all the Christoffcl symbols associated to V vanish. 



The space V* is clearly a statistical manifold of dimension n — 1 , and it can be turned into an exponential 
family by means of the following parameterization: 



p(x; 9) = expj £ 6 t F t (x) - ^(0)| , (15) 

^ »=i ' 

where x E fi , = (0 X , ..., 0„_ x ) G R"" 1 , Fi(xj) = 5 %] and where ip(9) = -ln(l + ^"Ij 1 cxp(0 4 )) . 

4 Dombrowski's construction 

In this section, we explain, following Dombrowski's paper |Dom62| . how the tangent bundle of a given 
dually flat manifold can be turned into a Kahlcr manifold by a simple geometric construction. This implies 
in particular that the tangent bundle T£ of an exponential family is naturally a Kahlcr manifold. 

Most of the results of this section are due to Dombrowski, except for Lemma l4~2l and subsequent corol- 
laries which are natural extensions of |Dom62j . 

Recall that if M is a manifold endowed with an affinc connection V , then Dombrowski splitting 
Theorem holds (see |Dom621IL"an"02] ) : 

T(TM) = TM®TM®TM , (16) 

this splitting being viewed as an isomorphism of vector bundles over M , and the isomorphism, say <fr , 
being 

T U TM 9 A Ux A ( Ux ,tt^A u „,KA Ux ) , (17) 

where it : TM — > M is the canonical projection and where K : T(TM) — > TM is the canonical connector 
associated to the connection V (see }Lan02| ). 

Having A Ux = $>~ 1 [(u x , v x , w x )) € T Ux TM , we shall write, for simplicity, A Uj . = (u x , v x , w x ) instead of 
<£ ,_1 ((u x , v x , w x j) , i.e., we will drop $ . The second component v x is usually referred to as the horizontal 
component of A Ux (with respect to the connection V) and w x the vertical component. 

With the above notation, and provided that M is endowed with a Riemannian metric ft. , it is a simple 
matter to define on TM an almost Hermitian structure. Indeed, we define a metric g , a 2-form uj and an 
almost complex structure J by setting 

gu x {{u x ,v x ,w x ),(u x ,v x ,w x )) := h x (v x ,v x ) + h x (w x ,w x ) , 
0J U:1 ,{{u x ,v x ,w x ),(u x ,v x ,w x )) := h x (v x ,w x ) - h x (w x ,v x ) , 

Ju x ((u x ,v Xl w x )) := (u x ,-w x ,v x ) , (18) 

where u x ,v x ,w x ,v x ,w x G T X M . 

Clearly, J 2 = —Id and g{ J . , J . ) = g( . , . ) , which means that (TM, g, J) is an almost Hermitian manifold, 
and one readily sees that g, J and u> are compatible, i.e., that u = g(J ., .) ; the 2-form u> is thus the 
fundamental 2-form of the almost Hermitian manifold (TM,g, J) . This is Dombrowski's construction. 
Observe that the map tt : (TM, g) — > (M, h) is a Riemannian submersion. 

In jDom62j . Dombrowski shows the following: 
Proposition 4.1 ([Dom62 ). Let V be an affine connection defined on a manifold M , and let J be the 



almost complex structure associated to V as in (|18p . Then, 

J is intcgrablc <=> V is flat . (19) 

8 



The tangent bundle TM of a manifold M endowed with a flat connection V is thus naturally a 
complex manifold. If in addition M is equipped with a Riemannian metric h , then TM becomes a 
complex Hermitian manifold for the Hcrmitian structure (g, J,ui) considered above. 

For the 2-form ui defined in (TT5|) . we have the following result: 

Lemma 4.2. Let (M, h) be a Riemannian manifold endowed with a flat connection V , and let ui be the 
2-form defined as in (|18l) . Then, 



duj = <S> T* = , (20) 

where T* denotes the torsion of the dual connection V* . 

Proof. Let us consider a V-affine chart (ip : U C M — > R") with local coordinates ip = (x\, ..., x„) . In this 
chart, all Christoffel symbols Tf associated to V vanish. Let us also consider the chart (Tp : U C TM — >■ 
W l x E n ) with local coordinates Tp = (xi,...,x n ,yi,...,y n ) canonically associated to (U,ip), i.e., U := 
n~ l (U) (ir : TM —} M being the canonical projection) and where ¥>(X)™=i a i^xi\x) '■— (i(5(2;),ai, ...,a„) . 
In this chart, it is not hard to sec that 

"<«> = ( -«*))., ( " ( o )k )' <21) 

where (h(x))ij := h(d Xz ,d Xj ) and where x = (xi,...,x„) and y = (yi, ...,y n ) . 
Now, £7 is closed if and only if for alii, j, k = 1, ..., n , 

(dnXfiU.Q^&J = °. (^)(^,^,9 y j = o, (22) 

(<m)(d Xi ,d y .,d Vk ) = 0, (dfi)(^ 4 Ai.^J=0, (23) 

and it is easy, using (|2"Tj) . to see that the only possibly non- vanishing terms in the equations (|2"2"|) and (|23p 
are (dQ)(d Xi , d Vj , d Vk ) = d Xi hjk — d Xj hik . Moreover, it is a simple calculation to show that 

d Xi h jk - 9 x .ft ife = h(T*(d Xi ,d x .),d Xk ) . (24) 

Hence, dft = if and only if T* = . The lemma follows. □ 

Recall that an almost Hcrmitian structure (<?, J, ui) on a given manifold is Kahler when the following 
two analytical conditions are met: (1) J is intcgrable; (2) du — . Having this in mind, Proposition 13.21 
Proposition 14. II and Lemma 14.21 readily imply the following two corollaries: 

Corollary 4.3. Let (h, V,V*) be a dualistic structure on a manifold M and let (g,J,u) be the almost 
Hermitian structure on TM associated to (h, V) via Dombrowski's construction. Then, 

(TM, g, J, w) is Kahler ^> (M, h, V, V*) is dually flat. (25) 

Corollary 4.4. The tangent bundle TE of an exponential family £ is a Kahler manifold for the Kahler 
structure (g,j,uj) associated to (hp,W^ e ') via Dombrowski's construction. 

In the sequel, by the Kahler structure of T£ , we shall implicitly refer to the Kahler structure of T£ 
described in Corollary 14.41 



5 Kahler functions on an exponential family 

Definition 5.1. Let [N,g, J, of) be a Kahler manifold. We shall say that a function f : N — > R is a 
Kahler function if 

■&x f 9 = 0, (26) 

where Xf denotes the symplectic gradient of f with respect to the symplectic form uj , i.e., Lo{Xf, . ) = df (.) , 
and where ^£x s denotes the Lie derivative in the direction Xf . 

Clearly, a Kahler function / : N — >• R preserves the Kahler structure of N in the sense that ££x s 9 = 
and J£x s w = , hence the terminology. Following JCMP90] , we shall also denote by JfT(N) the space of all 
Kahler functions defined on a Kahler manifold N . When N has a finite number of connected components, 
then the space of Kahler functions on N is a finite dimensional 10 ! Lie algebra for the natural Poisson 
bracket {f,g} := u(Xf,X g ) . 

In the case of a Kahler structure associated to a dually flat manifold via Dombrowski's construction, 
we shall use the following terminology: 

Definition 5.2. Let (ft, V,V*) be a dually flat structure on a given manifold M . We shall say that a 
function f : M — > R is a Kahler function if f o it : TM — > R is a Kahler function with respect to the 
Kahler structure of TM associated to (ft, V) via Dombrowski's construction (here n : TM — > M is the 
canonical projection). 

We shall denote by JfT(M) the space of Kahler functions on a dually flat manifold M . Clearly, 
Jf{M) C JfT{TM) via the map / ^ f o tt . 

We now want to characterize the space of Kahler functions on a dually flat manifold. To this end, 
recall that a vector field X on a manifold M is said to be V-parallel with respect to a given connection V 
if V Y X = for all vector fields Y on M . 

Proposition 5.3. Let (ft, V, V*) be a dually flat structure on a manifold M , and let (g, J, ui) be the Kahler 
structure on TM associated to (ft, V) via Dombrowski's construction. For a given function f : M — > R , 
we have: 

f is a Kahler function <=> grad (/) is V-parallel , (27) 

where grad (/) is the Riemannian gradient of f with respect to h , i.e. ft(grad (/), . ) = df (•) . 

In order to show Proposition 15.31 we need a lemma. 

Lemma 5.4. Under the hypothesis of Proposition^^ and, using the identification given in (|17|) . we have 
for u x €T X M , 

(X fO7I ) U:c = (u x ,0,-gmd h (f) x ) and V ? f ™ (u x ) = u x - tgi&d h (f) x , (28) 

where Xf 07r is the symplectic gradient of f o n : TM — > R with respect to lu and where tp t /0 " denotes the 

floW Of X/oTT • 



10 The fact that Jtf(N) is finite dimensional comes from the following result: if (M, h) is a connected Riemannian manifold, 
then its space of Killing vector fields Xm\\(M) := {X £ X(M) I Jt?xh = 0} is finite dimensional (see for example [Jos02 ). 
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Proof. For A Ui e T Ux TM , we have by definition of uj (sec (fI5]l) and grad h (/) : 

• (/ 07r )* Ux ^«x = w((-X'/o,r)t. a .,j4 t » a! ) 

= h{TT* n<c {Xf ^) Ux ,KA u<c ) - h{Tr t:ux A U!c ,K{Xf a „) Ux ) , 



(29) 
(30) 



Comparing (|29| and ((30]) . we get 7r* Ux (I/ M ) Ui = and -ftT(X/ 7r)ux = — g ra d'(/)ic , which, in view of the 
identification given in (|T7| . implies the first equation in (|2"5)l . 

The second equation in (|28p is an easy consequence of the first. The lemma follows. □ 



Remark 5.5. A simple consequence of Lemma \5.J\ is that J(f{M) , viewed as a Lie subalgebra of Jf?(TM) , 
is commutative. 

Proof of Proposition 15.31 In this proof, we use the identification given in (jTTJ) as well as the notation 
introduced in Lemma 15.41 

v 

Let A Ux = (u x ,V v ,w x ) and A Ux = (u x ,v x ,W x ) be two tangent vectors in T Ux TM . Since {iroip' t f ° 7 ')(u x ) = 
ir(u x — igrad (f) x ) = x (see Lemma T5.4p . Tr*(<p t /0 ")*„ A Uj . = 7T* n (u x ,v x ,w x ) — v x and thus, recalling 
the definition of g given in (TT5)) , 

((^f /0 Ts) u (^,I„J = fl((vf""),..A..,^f / '" r ). u .3 B .) 



^K,^) + /i(^(^ /0 ")* u ^ux,^^f /0T )* U:E ^ 



(31) 



x 



We have to compute K(tp t )*„ A^ . For this, observe that 



K(<Pt /0 ")* U x^«x = K (<Pt f °*)*„ x {Ux,V x ,W x ) 

= K(tpf f °-)~ (u x ,v x ,0) + K{ V f^). (u x ,0, w x ) 



(32) 



The second term in (|3"2")) is easily computed 

d 



x 



K((pt f °*).(u x ,0,Wx)=K 



ds 



ft /0 * ( u x + sw x ) = K— 



(u x + sw x - igrad (/) x ) = w x . (33) 



For the first term in (|3"2"|) . we introduce a curve V(s) in TM such that dV(s)/ds\o = (u x , v x , 0) and such 
that V(s) is horizontal for all s . Observe that d(ir o V(s))/ds\o = n*{u x ,v x ,w x ) — v x . Using this curve, 
we see that 



K(<p? f °n^Au x ,v x ,0) = K-^- 



^(V( S ))=K± 



(V(s)-tgi- a d h (f) <v{s)) ) 

V Vx (V(s) - tgrad h (/) w(VW) ) = V Vx V(a) - tV Vx grzd h (f) Av(8)) 
= -iV„ lg rad h (/) w(v(fl)) . (34) 

Now, (|3"T ]I . <|52 ]l . ([53 ]) and (|53 )l yield 

((<p? f °")*9) u (4,4) = .g(^ I ,A u J-t(/ l (V l , x grad' I (/),W ;c )+/ l (V^grad' l (/),«; ;c )) 

+ t 2 / l (V^grad /i (/),V ¥x grad /l (/)) (35) 

from which we clearly see that ip t /0 " is an isometry for all t if and only if grad (/) is V-parallcl. The 
proposition follows. □ 
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Let us now specialize to the case of an exponential family. So, let £ be an exponential family defined 
on a measured space (f2, dx) with elements of the form p(x; 0) = exp{C(x) + X)"=i ®iFi( x ) ~ VK^)} as m 
©, and let us consider the following space of functions: 

Ae:=Vect R {l,F 1: ... t F n }, (36) 

i.e., As is the real vector space generated by the constant function 1 and the functions Fi, ..., F n : O — >• R . 

Proposition 5.6. For a function f : £ — > R , we have: 

/ is a Kahler function <S> CbX £ A £ : f{p) = j X(x)p(x) dx \/p&£\. (37) 

In order to show Proposition 15.61 we need the following lemma. 
Lemma 5.7. The expectation parameters r\i : £ — > M.,p i— ^ J~ Fi(x)p(x)dx satisfy the following relation: 

grad^ { Vi ) = d 6t . (38) 

In particular, expectation parameters are Kahler functions. 

Proof. Using the duality between the natural and expectation parameters (see (fTU|) ). one easily sees that 
®Vi = Sfc=i {^Fy k de k , where the (/if) iA: 's are the coefficients of the inverse of the matrix (hp)ij := 
hplde^dgj) , and also that /if (9^,9^-) = (/if) 2 - 7 ■ It follows that 

Tl o 71 ft 

grad' lF (r/ l ) = J^ (h F ) ab —^d Vb = ^ {h F ) ab 5 la d 11b = s ^(h F ) ib d Vb 

a,b=l ria a, 6=1 b=l 

n n 

= 5Z (M*) 6 ^* = Y, S *kde k = d 6i , (39) 

b,fc=l fe=l 

which is the desired relation. D 

Proof of Proposition 15.61 Let / : £ — >• R be a function. If / is a Kahler function, then according to 
Proposition 15.31 grad F (/) is V'^-parallcl, which means that when expressed in the V^ e '-affine chart 
9 = (#i, ...,6 n ) , grad F (/) is a constant vector field. We can thus write grad lF (/) = Y^7=i a i^i , where 
Oi, ...,a n are some real constants. But then, according to Lemma 15.71 and the definition of rji , 



grad' 1 - (/) = J2 a ^ = Y, a ^ Ta - dhF (Vi) = grad hF ( £ a,r 

i= 1 i—1 ?'— 1 

= grad' lF (V a, / F ? (a;)p^)dx) = grad' iF ( / V a i F i (x)p(x)dx) . (40) 



Hence, and up to an additive constant, f(p) = J„ X)"=i aiFi(x)p(x)dx which shows one direction of the 
proposition. The other direction being trivial, the proposition follows. □ 

A direct consequence of Proposition 15.61 is the following result: 

Corollary 5.8. Functions of the form 

T£^R, z^t [ X(x)[(no$)(z)](x)dx, (41) 

Jn 

where X £ As and where $ : T£ — > T£ is a holomorphic isometry, are Kahler functions on T£ . 
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6 Kahlerification of an exponential family 

Let £ be an exponential family and let (g, J, ui) be the Kahlcr structure of T£ . We define a subgroup T(£) 
of the group of all diffeomorphisms Diff(T£) of T£ by letting 

T{£) := {(j) £ Diff(Tf) | <)>*g = g , 4>* J = J<)>* and / o = / for all / £ JfT{T£)}. (42) 

Definition 6.1. Let £ be an exponential family having a discrete T(£) and whose natural action ll \on T£ 
is free and proper. The quotient space T£/T(£) is thus naturally a Kdhler manifold for which the quotient 
map T£ — > T£/Y{£) becomes a holomorphic Riemannian submersion. We shall call this quotient the 
Kahlerification of £ , and use the following notation: 

£ c := T£/T{£) . (43) 

At this point, it is worth mentioning that in all the examples considered in this paper, the group Y{£ ) 
is discrete and that its natural action on T£ is free and proper. In the sequel, we will always assume that 
the exponential families under consideration fulfill the conditions of Definition 16.11 



Let us investigate the geometrical structure of £ . 
Lemma 6.2. For every 7 £ T(£) , we have 

TT O 7 = 7T , (44) 

where -k : T£ — > £ is the canonical projection. 

Proof. Since expectation parameters r/i : £ — > R are Kahlcr functions (see Lemma 15. 7[) . and since 77 = 
(771, ..., T] n ) is a chart of £ , we have by definition of T(£) , 

r\{ o 7r o 7 — r/i o 7r for all i 
=> r?(?r(7(x))) = ri(ir(x)) for all x £ T£ 

=> 7T O 7 = 7T . (45) 

This is the desired relation. □ 

Lemma l6T2"l readilv implies that the projection n : T£ — > £ factorizes through £ c , yielding a submersion 
£ c — >• £ that we shall denote by tt£ , or simply tt . A Kahlerification has thus the structure of a fiber bundle 
induced by the submersion 

ir £ : £ c -> £ , (46) 

whose fiber over p £ £ is diffcomorphic to T p £ /T(£ ) . 

Clearly, 7rg : £ c — > £ is a Riemannian submersion. Also, J(f{T£) = J(f{£ c ) (Lie algebra isomorphism), 
and consequently there are analogues of Proposition l5.6l and Corollarv l5.8l for the space of Kahler functions 
on£ c . 



L The natural action of T(£) on T£ is simply given by 7 • u x := f(u x ) , where 7 G r(f ) and u x S T£ . 
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7 Kahlerification of V* and complex projective spaces 



Recall from Example 13.41 that P* is the space of non- vanishing probability density functions p defined on 
a finite set Q = {x\, ..., x n } , i.e., 



n 

P* := ip : ft -> R p(xi) > for all x, e ft and ^p(x») = l} 



(47) 



This space is clearly a connected manifold of dimension n — 1 , and as we already saw, P* is an 
exponential family. 

In the context of information geometry, it is customary to describe the tangent bundle of P * using the 
exponential representation : 






{w = (ui, ...,M„) € R" \U1P1 + . . . + U n p n = 0} . 



(48) 



where p € P* , and where by definition, pi := p{xi) for all xi G Q . 

If u € K™ is a vector satisfying u%pi + . . . + u n p„ = for a given probability density function p : Q — > R , 
then we shall denote by [u] p the unique tangent vector of P* at the point p determined by the exponential 
representation. One easily sees that if p(t) is a smooth curve in P* , then 



d_ 



pit) = [u] p (o) <£> 



r// 



£>j(i) = Pi(0) Uj for all i = 1, • • • ,n. 



(49) 



where p^t) := (p(t))(xi) . 

Equation (|49p is actually one way to define the exponential representation. 

In term of the exponential representation, the Fisher metric hp has the following expression 



{h F )p([u] p , [v] p ) = ^PiUjVi 



(50) 



while the covariant derivative D^ [V] p ( t ) / dt of a vector field [V"] p (t) along a curve p : I C R — ¥ P* with 
respect to the exponential connection V^ is given by 



dt 



[ V (t)] p{t) =[V(t)-E Kt) (V(t))] p 



(t) 



(51) 



where E p{t) {V{t)) ■- pi(t)Vi(t) H \-p n (t)V(t) is the mean of the vector V(t) = (Vi(t),...,V n (t)) 

with respect to the probability p(t) and where E p ( t )(V(t)) in (|5T|) as to be understood as the vector 

(E p{t) (V(t)),...,E p{t) (V(t))) EW» . 

For later purposes, let us also give the following result which gives an explicit description of the inverse 
of the map $ : T(TV*) -> TV* © TV* © TP^ introduced in flT7]). The proof may be found in [MoTb] , 



Lemma 7.1. For [m] p , [u] p , [w] p £ T p V* , rae have : 



$ 1 ([u] p> Mp,H>) = ^ 



u + ito — -E" P (t) (u + iw) 



p(*) 



(52) 



where p(t) is a smooth curve in P* satisfying p(Q) = p and dp(£)/d£ = [v] p . 
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We now want to relate the natural Kahler structure of TV* to the Kahler structure of the complex 
projective space P(C n ) . To this end, and for the reader's convenience, let us digress a little on P(C ra ) . 

Recall that the complex projective space P(C") is the quotient (C™ — {0})/~ , where the equivalence 
relation " ~ " is defined by 

(zi,...,z n ) ~ (wi,...,w n ) <^ 3X G C - {0} : (zi,...,Zn) = \{wi,...,w n ). (53) 

For z = (zi, ..., z n ) G C™ — {0} , wc shall denote by [z] = [z\, ..., z n ] the corresponding clement of P(C") . 
One may identify [z] with the complex line <C ■ z . 

The manifold structure of P(C") may be defined as follows. For a vector u = (ui, ■■■,u n ) G C" such 
that |it| 2 = (it, u) = UiUi + • • • + u n u n = 1 (our convention for the Hermitian product ( , ) on C" is that 
( , ) is linear in the second argument) , we define a chart (U u , <f> u ) of P(C") by letting 

U u :={[z]€W>(C n )\[u}n[z} = {0}}, 

K ■ U u ^[u] ± CC n : [z}^-^—-z-u. (54) 

(u,z) 

If u varies among all the unit vectors in C" , then the corresponding charts (U u , 4> u ) form a holomorphic 
atlas for P(C") ; the projective space is thus a manifold of real dimension 2(n — 1) , and using the above 
charts we have the identification 

r M P(C") s[«] 1 = {«iec n | (u, w) = o}. (55) 

The Fubini-Study metric gps and the Fubini-Study symplectic form ujps are now defined at the point 
[u] in P(C") via the formulas : 

((OVs) (&>6) "Retell) . ((4>Z 1 )*VFs) o {ti,Z2):=Tm{t 1 ,£ 2 ), (56) 

where ^1,^2 S [v]~ L = T[ M ]P(C") and where "Re" and "Im" stand for the real and imaginary parts. 

One may show that gFS and tops arc globally well defined on P(C") , and that (gFS, Jfs, ^fs) is a Kahler 

structure on P(C n ) , where Jfs denotes the natural complex structure of P(C") . 

Now, consider the map 

r : TVZ ^P(C") X , [u] p » [^e^/ 2 ,...,^^^/ 2 ] , (57) 

where P(C") X is the open subset of P(C n ) defined by 

P(C") X := {[zi,...,z n ] GP(C n )|z l7 ^0 for all i = l,...,n}. (58) 

In [Molbj . the following is shown: 

Proposition 7.2 ( [Molbj ). 

(i) The map r : TV* — > P(C n ) x is a universal covering map whose deck transformation group is isomor- 
phic to Z n_1 , 

(ii) the following relations hold : 

t *.9fs = t5, t*uj fs = -uj, t*J=Jfst*, (59) 

where (g, J,oS) is the Kahler structure ofTV* associated to (hp, V' e ') via Dombrowski's construction. 
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Remark 7.3. In JMolbf . we were defining the Fisher metric hp as being the one considered in this paper, 
but multiplied by a factor 1/4. Because of that, the first two formulas in (|59[) differ from the corresponding 
formulas in \Molbf by a factor 1/4 . 

Remark 7.4. Observe that every deck transformation of TV* has to be a holomorphic isometry. 

We can now state the main result of this section. 

Proposition 7.5. The group T(V*) coincides with the deck transformation group of the universal covering 
map t : TV* —> P(C n ) x . In particular, if we multiply both the Fubini-Study metric gFS cmd the Fubini- 
Study symplectic form ujfs by a factor 4, then we get a natural identification of Kahler manifolds: 

(P n x ) c ^P(C") x . (60) 

Moreover, in term of the above identification, the canonical projection ir^,* : (V*) c — > V* becomes 

ZkZk 



n n : P(C") X -> V* , n v *([z])(x k ) := j^r 

[z, z) 

We will show Proposition 17. 51 with a series of lemmas. 



(61) 



Lemma 7.6. Let Jf(F(C n )) be the space of Kahler functions on P(C") and let u(n) be the space of 
complex nxn skew Hermitian matrices. If JfT(P(C™)) is endowed with its natural Poisson bracket {/, g} :— 
u) FS (X f ,X g ) , then the map u(n) -> JT(P(C")) , A ^ £ A , where 

e A (W):=^^^, (zeC"-{0}) (62) 

is a Lie algebra isomorphism. 

Proof. Sec for example JCMP90J . D 

Lemma 7.7. Let [z], [w] be two points in P(C n ) . We have: 

[z] = H ** f([z}) = /(M) for all / G jT(P(C n )) . (63) 

Proof. Let z, w G C" — {0} be two vectors. Using Proposition [71)] and especially (|62j) . it is easy to see that 
if f([z\) = f([w]) for all Kahler functions / on P(C") , then 

2Re(^ A ab (z a z b -w a w b j) + ^ A aa (\z a \ 2 - |w a | 2 ) = , (64) 

a<b a 

where A a b (a, b = 1, ..., n , a < b) arc arbitrary complex numbers, and where A aa (a = 1, ...n) are arbitrary 
real numbers (one may think of A ab as the coefficients of a Hermitian matrix). From (|M|) . we deduce that 

z a z b = w a w b (65) 

for all a,b = l,...,n. By introducing polar decompositions and with some algebraic manipulations, it is 
then easy to see that z and w are collincar. The lemma follows. □ 

Lemma 7.8. A function f : TV* — > R is a Kahler function if and only if there exists a Kahler function 
/ : P(C") -> R such that 

f=Jor. (66) 
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Proof. Let / : TV* — > R be a Kahler function. Since r : TV* — > P(C™) X is a covering map, for every 
z <G TV* , there exists an open and connected set U z C TV * containing z and such that the restriction of r 
to U z becomes a diffeomorphism between U z and t(U z ) . Let us denote this restriction by t\u z ■ According to 
(ii) in Proposition [7JH t\u, is a holomorphic isometry; this implies that the map /o (r|[/.) _1 : t(U z ) — > M 
is a Kahler function, which means in particular that Xt r T i v \-i is a Killing vector field on t(U z ) . But now, 
since P(C") is a connected, simply connected and complete (in the Ricmannian sense) Kahler manifold, 
their exists a Killing vector field on P(C") extending Xf a ( T \ v \-i (sec |Nom60| ) . and this Killing vector 
field is bound 12 ! to be a Hamiltonian vector field Xj v for some Kahler function fu z : P(C n ) — > IR, the 
latter being defined only up to an additive constant. By choosing this constant appropriately, we thus get 
a Kahler function fu z such that 

f\u m = (Jv.0T)\ Vm . (67) 

The above formula shows that the statement in the lemma is locally true. Let us now show that it is also 
globally true. So let U, V C TV* be two connected open sets whose intersection is not empty, and such 
that there exist two Kahler functions fu, fv '■ P(C") — »• K verifying f\u = {fu OT )\u ancl f\v — {Iv°t)\v ■ 
Since fu o r and fu OT coincide on the intersection of U and V , there exists a connected open subset of 
P(C") on which fu and fv coincide. But now, since Kahler functions on P(C") arc of the form f A (sec 
Lemma l7.6[) . it is clear from (|62[) that if two Kahler functions on the complex projective space coincide on 
an open subset, then they are equal. This implies fu = fv from which the lemma follows. □ 



Since every deck transformation of TV* is a holomorphic isometry, it follows from Lemma I7T8I that the 
deck transformation group of the universal covering map r : TV* — s- P(C") X is a subgroup of T(V*) . 
The converse is given by the following lemma. 

Lemma 7.9. An element 7 G T(V*) is necessarily a deck transformation. 

Proof. Let 7 be an clement of T(V*) . By definition of T(V*) , and taking into account Lemma [7.71 and 
Lemma [7751 we see that 

(/oto 7 = /ot V / e JT(P(C™))) ^> to 7 = t, (68) 

i.e., 7 is a deck transformation. The lemma follows. □ 

Equation (|61[) being straightforward, Proposition 17.51 is now a direct consequence of Lemma 17.81 and 
Lemma 17.91 

Let us end this section with a few important remarks on the map ir v x : P(C") X — > V* . Clearly, ir v x 
extends uniquely as a continuous map 7rp n : P(C") — > V n , wherd 13 ! 



V„ 



n 

[p : n ->• K p(x t ) > for all i.efi and ^p(a^) = l| . (69) 



! = 1 



Notice that V* C V„ , and that these two spaces are distinguished only by the conditions p > for V^ 
and p > for V n ■ Notice also that V n is not a smooth manifold, for it has a boundary and corners. 



12 It is well known that every Killing vector field on P(C") can be realized as the Hamiltonian vector field of an appropriate 
Kahler function. 

13 We endow V n with the topology induced by 1" via the injection V n *— > K" , p i-> (p(x'i), ...,p(x n )) . 
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The map Tr-p n : P(C") — > V n has the property that it makes the following diagram commutative 



T_X 



P(C«) x ^U~p* (70) 



j 



(i,j are inclusions), which allows to carry over many structural properties of the space of Kahler functions 
on P(C") X to the space of Kahler functions on P(C") . Indeed, let J^(V n ) denotes the following space of 
functions 

n 

{/ : V n ->• R | f(p) =Y / X k p(x k ) , X = (X u ...,X n ) € M"} . (71) 

fc=i 

By using Corollary 15.81 Lemma 17.81 some obvious continuity arguments and the fact that A v * = {X : 
fl — >• R} = M. n (see the general definition of As given in ([55)1 and Example 13.4)) , one easily shows the 
following: 

Proposition 7.10. We have: 

(i) jr(P(C") x ) S JT(P(C")) , ('Lie atye&ra isomorphism), 

(ii) X(V*)=X(V n ), 

(Hi) functions on P(C") of the form f o ir-p n o $ , where $ : P(C") — > P(C") is a holomorphic isometry 
and where f G J^(V n ) , are Kahler functions. 

We see from Proposition 17.101 that the map 7pp n : P(C") — > Pn behaves like the canonical projection 
of a Kahlerification. It is thus a natural "completion" of the map ir v * : P(C) X — > V„ , and formally we 
have (V n ) c = P(C") . 



8 Kahlerification and the geometrical formulation of quantum 
mechanics 

The goal of this section is to rederive the geometrical formulation of quantum mechanics in finite dimension 
(based on the Kahler properties of P(C") as in |AS99j ). using a statistically oriented approach through 
the equation (T*) c = P(C") X and its formal version (Vn) C = P(C n ) . 

This exercise is necessary, for we want to express all the relevant quantities of the geometrical formu- 
lation in terms of statistical concepts, aiming to generalize them to situations where V* is replaced by a 
more general exponential family £ . 

Let us start with the following "statistical" characterization of Kahler functions on the complex projective 
space (see also Corollary 15. 8[) . 

Proposition 8.1. Let f : P(C") — > K be a smooth function. Then, f is a Kahler function if and only if 
there exist a random variable X : Q, = {xi, ...,x„} — > R and an unitary matrix U € U(n) such that 

f([z]) = / X(x) [(ir Vn o $u){[z])] (x)dx , (72) 

Jn 

where ir-p n : P(C") — > V n is the map considered at the end of O and where $[/ is the holomorphic 
isometry o/P(C") defined by $[/([z]) = [U ■ z] . 
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Proof. Simply use Lemma 17.61 together with the usual spectral decomposition theorem. □ 

Proposition 18.11 implies that every Kahler function on P(C n ) can be realized as an expectation of the 
form /([z]) = i£(, r o$ !7 )([z])P0 > where X : ft — >• R is a random variable. The image lm(X) of X has thus 
an important statistical meaning that we would like to relate with the usual spectrum of / as defined in 
the geometrical formulation of quantum mechanics. 

To this end, we could use Lemma [7.61 Proposition 18.11 and relate lm(X) with the usual spectrum of 
an appropriate Hcrmitian matrix. We prefer, however, to use a generalization of a statistical result that 
we now present. 

Recall that if X : f2 — > R is a random variable, and if p € V n is a given probability, then the variance 
of X with respect to p is given by V P (X) = E P ((X — E p (X)) 2 ) . 

Proposition 8.2 (Cramer-Rao equality). Let f([z\) = J n X(x)[(ir-p n o&u)([z])](x)dx be a Kahler function 
on P(C") . For all [z] £ P(C") , we have: 

V(n Vn o<s>v)[[z])(X) = -||grad(/) [z] || 2 , (73) 

where grad(/) denotes the Riemannian gradient of f with respect to the Fubini- Study metric g F s ■ 

Proof. If / : P(C") — > M is a (non-necessarily Kahler) function, and if 4> : P(C") — > P(C") is an isometry, 
then, for [z] € P(C") , we have: 

grad(/ o tf,) [x] = (0- 1 ),(grad(/)) 0([z]) . (74) 

From this formula, and the fact that $j/ is an isometry, we see that it is sufficient to show the proposition 
for U = I n . So let X : O — > R be a random variable and assume that /([z]) = Y^k=i -^{ x k)^v n {\ z \){ x k) ■ 
Taking into account Proposition l7.21 we see that the function / := / o r : TV* — » R satisfies 

T *H P S rad (7)[« P ] = 4S rad (/)r(M P ) and || grad(/) M J 2 = -|| grad(/) r([u]p) || 2 , (75) 

where grad(/) denotes the Riemannian gradient of / with respect to the Riemannian metric g (see the 
definition of g in Proposition ^. 2[) . Moreover, for A[ u ] p = ([u] p , [v] p , [w] p ) = dj 'dt\ [u + tw — E p ^{u + tw)] p ( t ) 
as in Lemma l7.ll and taking into account the relation {~n-p n ° T )([' u ]p) — P > we see that 

g [u]p (gmd(f) [u]p ,A [u]p ) =7, M A [n]p = C 



<lt 
d 

fc=i fe=i 



f([u + tw- E p(t) (u + tw)] p(t) ) 
53 p(t)(x k )X{x k ) = 5] p(x k )v k X(x k ) = (h F ) p ([v] p , [X - E p {X)] p ) + (h F ) p ([w] p ,0) 



dt 

= ff Mp (([«] P) [X - E P (X)} P , [0] p ), ([«]„, [v] p , M P )) , (76) 

and thus, 

grad(7) M „ = (Mp. [^ - S P (X)] PJ [0] p ) . (77) 

From this equation, and taking into account (|50|) . we get 

||grad(7) Mp || 2 = E P ((X - E p {X)f) = V p (X) . (78) 

The proposition is now a consequence of this last equation together with (|T5"|) and the fact that tCTV^) 
is dense in P(C") . □ 
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Remark 8.3. Provosition \8.2\ is a direct generalization of a formula which is well known in the context 
of information geometry, namel y 14 ] 

\\ gra d(E p (X))\\ 2 = V p (X), (79) 

where E P (X) denotes the function V* — > R ,p h- > E p {X) (X : Vt — > R being a given random variable), and 
where the norm and the Riemannian gradient are taken with respect to the Fisher metric hp . The above 
formula is sometimes called Cramer-Rao equality for, it allows to recover the usual Cramer-Rao inequality, 
the latter being, roughly, an inequality which gives a "lower bound" for the variance- covariance matrix of 
an unbiased estimator on a given statistical model S (see lANOOf for details). 

Remark 8.4. More generally, if £ is an exponential family whose elements are of the form p(x; 8) = 
cxp|C(a;) + X]r=i @iFi( x )~' l P(Q)} on a fixed measured space (il,dx) (see Definition WJ]) '. and if X : il — > R 
is a linear combinaition of 1, F\, ..., F n , then one can easily show the following identity 

\\grad(E p (X))\\ 2 = V p (X), (80) 

where E p {X) is viewed as the function £ — > R , p \— > E p (X) and where the norm and the gradient are 
taken with respect to the Fisher metric Hf ■ 

By inspection of the Cramer- Rao equality (as formulated in Proposition 18. 2[) , one deduces easily the 
following corollary: 

Corollary 8.5. Let f([z}) = J n X(x)[(-K-p n o ^u)([z])](x)dx be a Kdhler function on P(C n ) . Then a real 
number X belongs to Im(X) if and only if X is a critical value of f , i.e. if and only if 

3 [z] G P(C n ) such that /* w = and f([z}) = X . (81) 

The above corollary implies that the set Im(X) doesn't depend on the particular decomposition of / 
given in Proposition 18. II We can thus give the following definition: 

Definition 8.6. The spectrum of a Kdhler function f([z}) = J n X{x)[{'K-p ri o ^u)([z])](x)dx on P(C") is 
the subset of R given by 

spec(/) := lm(X) . (82) 

This is the set of all critical values of f . 

Following [AS99J . we shall call elements of spec(/) eigenvalues and the corresponding critical points 
eigenpoints. These are the geometrical analogues, in the geometrical formulation, of the usual eigenvalues 
and eigenvectors of Hermitian matrices used in the standard formulation of quantum mechanics. 

Given a Kahler function /([z]) = / n X{x)\{ir-p n o §u)([z])](x)dx and a point [z] G P(C") , there is an 
obvious associated probability PfU] on spec(/) : 

Pf.[z](A):= ( [(n Vn o<i> u )([z})](x)dx, (83) 

JX- 1 (A) 

where A C spec(/) is a given subset. This is the pushforward of \[^v n ° ^c/)([^])] {x)dx via the random 
variable X : -> R . 

As for spec(/) , we would like to show that this probability doesn't depend on the particular decom- 
position of / given in (|8.1[) . To this end, we introduce, for a given real number A , the following space: 

M f ,x ~ {[z] G P(C") | /. w = and f([z}) = X} . (84) 

Observe that if A ^ spec(/) , then Mf t \ = . 

14 Thc absence of the factor 1/4 in (1706 compared to (173 6 is due to the normalizing factor of the Fubini-Study metric gj?s 
used throughout this section (see also Proposition 17.211 . 
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Lemma 8.7. Let A G spec(/) be an eigenvalue of a given Kahler function f([z]) = /„ X(x)[(jr-p n o 

3>ir){[z])](.r)dx . and let. us fix, for notational convenience, some indices k\ fe m (A) £ {1 '"} so that we 

can write X (X) = {x kl , ..., Xk mW } ■ Let us also denote by {ei, ..., e n } the canonical basis for C" . Then, 

M /jA = [if* ■ [c x ■ e kl + ... + c m(A) • e kmW ] E P(C") | ci,...,c m(A) G c} . (85) 

In particular, Mf,\ = P(C m ( A )) . 

Proof. Let [z] be an element of P(C") . We have: 

[z] e M />A <S> (/* w = and f([z]) = a) 
^ ^(7r-p n o* a )([z])(^") = and £ , ( 7rPn0 $ [/ )([ 2 ])(X) = A 

<=> J2 ( x ( x k) - A) 2 (ttp„ o $ c/ )([z])(x fc ) = and E^ Vn0 ^, u)([z]) {X) = A 
fe=i 

(X(x fc )-A) 2 (^„o$ [/ )([z])( a;fc )=0 for all ke{l,...,n} 



and B( WT , n0$[J )( W )(X) = A 

(tt-p„ o$ [/ )([z])(xfc) =0 for all fc <E {1, ...,n} - {fci, ..., fc m(A) } 
and i?( 7rT , n0$(:; )([ z ])(X) = A 

<S> (7r P „ o $ [/ )([z])(.T fc ) = for all fc e {1, ..., n} - {ki, ..., fc m(A) } . (86) 

Now observe that for k € {1, ..., n} , 

(7Tp 7i oO^fz )(x fc ) = = , (87) 

{z,z) (z,z) 

and thus, 

[z] G A//. A «*■ (z, t/*e fc ) = for all k e {1, ..., n} - {fc 1; ..., fc m(A) } (88) 

from which the lemma follows. D 

From Lemma \<$. 71 we see that the cardinal of X _1 (A) doesn't depend on the decomposition of / ; we 
shall call this number the multiplicity of the eigenvalue A and denote it by m(A) . As we already saw, 
M/,a = P(C m(A ') . In |AS99j . M/, A is called the eigenmanifold of / associated to A . 

Now recall that the geodesic distance d( , ) on P(C") induced by the Fubini-Study metric gFS is given, 

for [z],[w] eP(C"),by: 

<W l [«])-««- 1 ( ]j !^ j |). (89) 

The above formula together with the Cauchy-Schwarz inequality readily implies the following lemma: 

Lemma 8.8. Let A £ spcc(/) be an eigenvalue and let [z] be a point in P(C ra ) . Then their exists a unique 
point pm, a ( [z] ) in Mf t \ verifying 

d(p Mf J[z]),[z]) <d([w],[z]) (90) 

for all [w] € Mf } \ such that [w] ^ Pm s x {[ z \) ■ Moreover, if X (X) = {x kl , ■•■, x km(y } > then 



r m(A) 



PMfMA 



J2(U*e kn z)-U*e kl 



(91) 
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Clearly, if [z] £ P(C n ) , then d[[z] t pM f A ([ Z D) i s the geodesic distance between [z] and the subset M^\ ; 
we shall write d{[z],Mf t \) = d([z],PM fA {[z])) ■ 

Proposition 8.9. Let f([z]) = J n X(x)[(w-p n o $u)([z])](x)dx be a Kdhler function, and let A <G spec(/) 
be an eigenvalue. For [z] £ P(C") , we have: 

f [(7r Vn o^ u )([z})](x)dx^co S 2 (d([z},M fiX )). (92) 

Proof. Let z £ C" be a normalized vector, and assume that X (X) = {xk x , ■•■, Xk m(x) } ■ If Z\ := 

J2"=i {U*e-k n z) ■ U*ek t , then clearly (z\, z) = (z\, z\) , and according to Lemma IcTgl [z\] = pm } a ([ z D 
and d([z], [z\]) = d([z], Mf,\) . Hence, 

m(\) m(X) 

/ [(n Vn0 * u )([z])](x)dx=Y i {*v„°*v)([z])(x kl )= J2 K^^.e*.)| a 

"»(A) ^ 1/ il v 2 / \2 

= 5Z \i z > u * ■ e *i)\ 2 = \( z ^ z )\ = f iuTTjuif) = f cos ( d (N'^ ,J «'/.A(N)))) ( 93 ) 

which is exactly (p?2"j) . The proposition follows. D 

A direct consequence of Proposition 18.91 is that the measure Pf,[z] on spec(/) defined in (|83[) doesn't 
depend on a particular decomposition of / such as in (j8.ll) . 



With the above proposition, we have completed our "statistical" study of the geometrical formulation 
of quantum mechanics in finite dimension. 

The important points are : the configuration space P(C n ) is the (formal) Kahlerification of V n ; ob- 
servables are Kahler functions / : P(C") — > R that can be decomposed as /([z]) = J n X(x)[(n-p n ° 
§u)([z\)](x)dx , where X £ A v x and where $y is a holomorphic isometry of P(C) ; the spectrum of a 
Kahler function is Im(A) and its associated probability is ■?/,[«] (A) = J x -i, x) X(x)\(ir-p n ° ^u)([z])](x)dx . 

As we see, these quantities depends only on the exponential structure of Tffi 15 !. Thus, we can try to 
generalize them to a given exponential family, as follows. 

Let £ be an exponential family defined on a measured space (fi, dx) with elements of the form 
p{x;9) = exp{C(x) + J27=i ®iFi( x ) ~ VK^)} i -Ae '■= Vectn{l,Fi, ...,F n } and 7r e : £ c — >• £ its associ- 
ated Kahlerification. 

Regarding £ as the underlying statistical model of a "generalized quantum system" , we are led to the 
following definitions : 

• Configuration space : £ c , viewed as a Kahler manifold, 



• 



Observables : this is the set of functions / : £ c — >• R of the form 



£*-->R, z^ 

in 



[ X(x)[fao$)(z)](x)dx, (94) 

Jn 



where X £ As and where $ is a holomorphic isometry of £ c . Such a function is necessarily a Kahler 
function according to Proposition l5.8l 



15 Although being physically clear, this statement has still, on mathematical grounds, to be clarified since one has to make 
precise the passage from P(C") X to its natural "completion" P(C") . In this paper, we shall not treat this technical question, 
preferring to focus on the general procedure and the physical applications. 
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• Dynamics : it is given by the flow of the Hamiltonian vector field Xh associated to a given observable 

H : £ c — > R with respect to the natural symplectic form of the Kahler manifold £ c , 

• Spectrum of an observable : the spectrum of an observable / as in (|M|) is given by the image of the 

random variable X , 

spec(/) := Im(X) , (95) 

• Probabilities associated to an observable : the probability that an observable / as in (|M|) yields 

upon measurement an eigenvalue belonging to a subset A C spec(/) while the system is in the state 

z e £ c is : 

P f ,z(A)--= I [{* s o4>)(zj\{x)dx. (96) 

Remark 8.10. Usually, the decomposition of a Kahler function f as in (|94[) is not unique, and thus 
spec(/) and Pf_ z are only well defined when invariance properties are met. 

Of course, and from a physical point of view, the above definitions cannot be taken too literally. For 
example when £ = V* , then it is not ("P*) c = P(C™) X = (C*)™ -1 which is interesting, rather its 
"completion" P(C n ) , the formal Kahlerification of V n ■ Another similar example is the space of binomial 
distributions B(n, q) considered in JO As we will see, its Kahlerification is an open dense subset of the 
two dimensional sphere S 2 , and, as for "P* , it needs to be completed in order to recover a satisfactory 
description of the spin. 

Despite these technical difficulties and ambiguities, we shall use the above definitions as a basis for our 
physical investigations, and adapt them in an obvious way when a natural "completion" exists. As we will 
see, this already leads to interesting physical results. 

9 Binomial distribution and the spin of a particle 

Let B(n, q) be the space of binomial distributions defined over O := {0, ..., n} . By definition, an element p £ 
B(n, q) is characterized by a real parameter q £ ]0, 1[ verifying, for k £ il , p(k) = (J^q k (l — q) n ~ k , where 
(fe) = <n~k)\k\ ■ ^ ne se ^ °f binomial distributions forms a 1-dimcnsional statistical manifold parameterized 
by q and is easily seen to be an exponential family, for one may write 

p(k)= ("U fe (l- 9 )"- fe =cxp jlnrM +fc0-nln(l + cxp0)j, (97) 

where 9 := ln(-j^-) ■ In particular, setting C(k) := In (?) , F(k) := k and ip(8) := nln(l + exp0) , one has 
p(k) = cxp{C*(fc) + 6 ■ F(k) - ip(6)} . 

Let S 2 := {(x,y,z) G M 3 | x 2 + y 2 + z 2 = 1} be the unit sphere endowed with its natural Kahler 
structure (g S 2 , J S 2 , lu S 2 ) and let us write (S" 2 ) x := S 2 - {(1,0, 0), (-1,0, 0)} . 

Proposition 9.1. If S 2 is endowed with the Kahler structure (n- gg2, J52, n-uig2) (i.e. its natural Kahler 
structure is multiplied by n) , then 

B(n, q f = (S 2 r, (98) 

and in term of this identification, the map TtB(n,q) '■ B(n, q) — > B(n, q) becomes 

7T B ( n , q ) ■ (S 2 r^B(n,q), n B{rhq} (x,y,z)(k) = y( n \l + x) k (l-x) n - k . (99) 



2:-! 



Proposition 19 . 1 1 follows from direct computations. Indeed, in term of the natural parameter £ K , the 
Fisher metric hp on B(n, q) is (see $32]) 

from which one easily sees that the canonical structure (<?, J, w) of TB(n,q) is, using the identification 
TB(n, q)=WL 2 , 9de ^ (0, 0) as well as dTSJ), 

la h\ nc ^ 9 f 1 °\ t f° -A (a a\ ncx P f° A firm 

A basis for J^(TB(n, q)) is easily seen to be 

1/nM cos(0/2) sin(0/2) ,,„„, 

1, tanh(0/2) , , ,'L , , '' , 102) 

w ' ' cosh (0/2) cosh (0/2) ' v ; 

where tanh (x) = cx Py i: >~ c:x P(~ x ) and cosh (x) = gx p w+cxp (-£) _ 

v ' cxp (:r)-fcxp (— :r) *> ' 2. 

As a Lie algebra, the space J^(TB(n,q)) , endowed with the natural Poisson bracket associated to u> , 
is isomorphic to the Lie algebra u(2) of the group of unitary matrices U(2) via the isomorphism 

cos(0/2) 1 /0 -A sin(0/2) 1 /0 z N . 

^ ^T i n ' „„„u 7Z 7^ ^ 7TT I »• n • ^ iUd i 



cosh (0/2) 271V 1 0/' cosh (0/2) 2n V* °y 

Clearly (see (|42p). the group T(B(n,q)) is isomorphic to Z, its natural action on TB(n,q) being 
k ■ (0,0) = (0,0 + 4kn) , which is obviously free and proper, and the quotient TB(n,q)/T(B(n,q)) is 
diffeomorphic to (S 2 ) x via the map 

where [(0, 0)] := Z • (0, 0) = { (0, + 4/ctt) e R 2 | fc e Z} . 

A direct calculation shows that if the canonical Kahlcr structure of (S 2 ) x is weighted by n , then (|104p 
defines a map which is an isomorphism of Kahlcr manifolds, whence Proposition 19. II 

The canonical projection ~Ks(n,q) '■ {S 2 ) x = B(n, q) c — > B(n, q) can be naturally extended to the whole 
sphere S 2 provided we adjoint two elements to B(n,q) , namely the Dirac measures <5o and 6 n defined, for 
fce{0,...,n},by 

*>(*) = { if MO ' S ^ = { Jfifc^n • (1 ° 5) 

Let us denote B(n, q) := B{n, q) U {So, 5 n } (disjoint union). Clearly, the map iTB(n,q) '■ (S 12 ) x —> B(n, q) ex- 
tends uniquely as a continuous map 7Tg/ n \ : S 2 — >• B(n, q) , with 7Tgv n „\ (— 1, 0, 0) := do , ^g^ „) 0-> ®i 0) := 
S n , making the following diagram commutative : 

(S 2 ) x ^^B(n,q) (106) 

J 
S 2 — — *B(n,q) 

21 



(i, j being inclusions). Observe that B(n, q) is naturally a topological space since it is included in V n +i ■ 

In this situation, one may show an analogue of Proposition 17.101 and so, in the sequel we shall mainly 
focus on 7Tg, n instead of ^B(n,q) > an d heuristically we shall write B(n, q) c = S 2 . The space 5 2 will be 
thought of as the formal Kahlerification of B(n, q) c . 

The space of Kahler functions on S 2 is easily seen to be Jff(S 2 ) = Vectm{l,a;, y,z} . From the 
Kahlerification point of view however, it is more natural to give a description of J^(5 2 ) similar to that 
of Proposition 18.11 To this end, observe that the group of holomorphic isometries of S 2 is 50(3) and 
that the space of functions As( n ,q) associated to B(n,q) (see (|55|) for the general definition of As) is 
generated by the constant function 1 and the identity function k : Q — > Q , i.e. X G Ais(n,q) if and only if 
X(k) — a + p ■ k for some a,/3 G R. 

Proposition 9.2. Let f : S 2 — >• R be a smooth function. Then f is a Kahler function if and only if there 
exist X G A.Bt n ,q) and <f> G 5*0(3) such that f can be written 



f(x,y,z) = / X(k) [(*&(„,,) o 0) (a;, y,z)](k)dk, ((x,y,z)eS 2 ) (107) 



.'2 



where 7Tg(„ „\ : S —¥ B(n, q) is the canonical projection coming from the Kahlerification of B(n, q) . 

Proof. Let X(k) = a + f3 ■ k G AB( n ,q) be arbitrary and let <f> G 50(3) be an isometry such that, using a 
matrix representation, 

(a b c\ 

(108) 




(in particular, the real numbers a, b, c satisfy \\(a, b, c)|| = 1 , i.e. a 2 + 6 2 + c 2 = 1) . 
A simple calculation shows that for (x, y, z) G 5 2 , 

x ( fc ) [(^(n „) ° < ^)( a; ' 2/' z )] ( /c )' ifc =0+^ + ^(00? + 6y + cz) (109) 

which is a Kahler function on 5 2 since it is a linear combination of 1, x, y, z . 

Reciprocally, if uq, u, v, w G K with (w, v, w) ^ , then the equation J fi ^(fe) [(^(n o)° ( / ) )( ;E i 2/; z )] (fc)^ = 
uq + ux + vy + wz , with unknowns a, /3, a, 6, c , admits as a solution 

2 1 

a = u ±\\(u,v,w)\\, (3 = T-\\(u,v,w)\\, (a, b, c) = T777 rrr («, v, w) , (110) 

n \\(u,v,w)\\ 

where "± = — " if (3 > and "± = +" if f3 < , and where || . || is the Euclidean norm. If (u,v,w) = , 
then a solution is given by a — u$ and j3 — (0 being arbitrary). The proposition follows. □ 

Following our discussion at the end of JJH we want to define the spectrum spec(/) of a Kahler function 
f(x, y, z) = J n X(k) [(ir%r„ „) ° 4>){x, y, z)] (k)dk on 5 2 as Im(X) , and its associated probability on spec(/) 
as Pf,{x,y,z){A) = Jx-i(A) [("Wnn) ° 4'){ x iyTz)\{k)dk . For this to be consistent, we need to check that 
these formulas are independent of the decomposition of / given in Proposition 19.21 

Proposition 9.3. Let f(x,y,z) = uq + ux + vy + wz be a Kahler function on S 2 . Then the spectrum 
spec(/) and the probability PfA x ,y,z) are "well defined, and we have spec(f) = {Xq, ..., A„} , where 

2 / n \ 

\ k = u + -\\{u,v,w)\\ ■ [- - +k) (111) 
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(k = 0, ..., n; || . || Euclidean norm), and if (u, v,w) ^ , t/ien 

P/ ) (^)(A fc ) = F y( k l+ || (UjVjW) || J I 1 " ||( U)W)W )|| J ' (112) 

Corollary 9.4. Let (u, v, w) £ M 3 be a vector whose Euclidean norm is j := n/2 , and let f : S 2 — > R 6e 
i/ie Kdhler function defined by f(x, y, z) := ui + vy + wz . Then, 

• spcc(/) = { -j,-j + l,...,j- l,j}, 

• P / , ( ^, 2) (-J + A ; )=Q(cos 2 (0/2)) fe (sin 2 (0/2))"^, (113) 

where 6 is an angle satisfying u «^"y~yj\\\ — cos((9) . 

Remark 9.5. As mentioned in the introduction, (|113[) gives the probability that a spin-j particle entering 
a second Stern-Gerlach device with maximum spin state (see Footnote^ is deflected into the (—j + k)-th 
outgoing beam, where 9 is the angle between the two magnetic fields produced by the two Stern-Gerlach 
devices (see for example [Mar02i). We will see subsequently how to obtain the probabilities corresponding 
to an incoming particle when the eigenvalue of its spin operator along the magnetic field of the first Stern- 
Gerlach device is arbitrary. 

Proof of Provosition \9.S[ Let f(x, y, z) = J n X(k) [(7Tg/ n \ ° 4>){x, y, z)] (k)dk be a Kahler function on S 2 
with X(k) = a + f3 ■ k (a,j?6K) and <f> having a matrix representation as in f|108|) . We have to show that 
spec(/) := Im(a + /3 ■ k) = {a, a + /3, ..., a + (3 ■ n} is independent of the decomposition of / . For this, we 
need to check that if / can be written f(x, y, z) = Y^=o (p + P ■ k) [(7rg, n \ o 4>)(x, y, z)] (k) with different 
ajel and a different £ 50(3) (with different a,b,ceR), then Im(a + p ■ k) = Im(a + /? ■ k) . To this 
end, observe that if ££ =0 (a + /3 • A) [(7Tg (n>g) o ^)(x, y, 2)] (&) = £Lo (a + P-k) [{n B{n q) o 0)(x, y, z)] (k) 
for all (x, y, z) E S 2 , then 

fin fin fin fin — 

a + — = a + — and — • (a, b, c) = — ■ (a, b, c) . (114) 

Taking into account the fact that ||(a,6,c)| = ||(a,6,c)|| = 1 , one immediately sees that \/3\ = \/3\ , and we 
are led to the following three possibilities: 

fi = ~fi = 0,a = a or fi ^ 0, fi = 0, a = a, (a,b,c) = a,b,c) (115) 

or P^O,p = -p,a = a-P-n,(a,b,c) = -(a,b,c). (116) 

The only ambiguity is for the last case for which we have : 

Im(a + P-k) = Jm(a + P -n- P -k) = {a + P -n- P-k\k = Q,...,n} 
= {a + p ■ (n- k)\k = 0,...,n] = {a + p ■ k\k = 0,...,n] 
= lm(a + P-k). (117) 

Hence Im(a + P • k) = Im(a + P ■ k) . It follows that spec(/) is well defined. 

In terms of uq, u, v, w, and assuming /3 > for simplicity (the case /3 < leads to the same result), wc 
have, using (|110p . 

spec(J) = Im(a + p ■ k) = {a + k ■ p \ k = 0, ..., n} 

= I uq — \\(u, v,w)\\ + k ■ — \\(u, v, w)\\ k = 0,...,n> 

f uo + - ||(tt, v, w)\\-(-^ + k)\k = 0, ..., n} . (118) 
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This is exactly the first item in (|111[) , 

Now, by similar arguments, one shows that Pf,( x ,y,z) is indeed well defined and yields a probability on 
spec(/) . The proposition follows. □ 

As we already mentioned, not all the possibilities in the Stern-Gerlach experiment are exhausted with 



(|113p . But of course, we would like to recover all these probabilities following our "statistical approach". 
This may be done as follows. 

Let j : B(n, q) "—} V* +l be the canonical inclusion. The composition of its derivative j t : TB(n, q) ^-> 
rP* +1 with the map r : TT* +1 -> P(C" +1 ) X considered in JF] yields a map TB(n,q) -> P(C" +1 ) that 
we would like to describe. To this end, recall that the elements of B(n, q) can be parameterized by the 
natural parameter 9 G K as p(k; 9) •— exp { In (™) + k9 - nln (l + exp(0)) } (see ([57])). and that TB(n, q) 
is identified with M 2 via the map 9de n- (9, 9) . 



Lemma 9.6. In therm of the natural parameter 9 , the map roj, : TB(n, q) — > P(C™ +1 ) reads 



(r°J*)M) 



p(0; 0) 1/2 , p(l; 9) 1 ' 2 e^'\ ..., pin- 9) 1 / 2 e^ n ' 2 



(119) 



Proof. Take a smooth curve 9(t) in R and set 9 := 9(0) and 9 := ^ \ &(t) ■ For fe € {0, ..., n} , we have 

d 



d 
dl 



p{k;9{t)) = 



dt 



expi In ( ?) + k9(t) -nln (l + exp(0(£)) 



o 

fe- 



ll! 



fe 

exp(0) 



1 + cxp(0) 



+ fe0(i)-nLn(l + exp(0(i))) 

p(k,9), 



p(k,9) 



(120) 



from which we see that j*(9,9) corresponds, in the exponential representation of TV* +1 (see $7]), to the 

0, ...,n}, by 

exp(0) 



vector [u(9)] ,i \ , where u(9) G M" +1 is defined, for k G {0, ..., n} , by 



u(9) k = 9(k 



1 + exp(0) 



(121) 



The lemma is now a simple consequence of (|121[) together with the definition of r (see (J57jl ) and the 
homogeneity of the homogeneous coordinates of the complex projective space P(C" +1 ) . □ 

Recall that the group r(2?(n, q)) is isomorphic Z and that its action on TB(n, q) is given by fe ■ (9, 9) = 
(8,6 + ikir) . Clearly, to j* is Z-invariant, and since TB(n, q)/T(B(n, q)) = (S 2 ) x (see (|104[1 ). we get a map 
(S 2 ) x — > P(C" +1 ) which can be conveniently described by the following parametrization of the sphere, 



x = cos(a), y = sin(a) cos(/3), z = sin(a) sin(/3) , 
where a G [0, k] , /3 G [0, 2n] . With these parameters, the map (S 2 ) x -> P(C™ +1 ) reads 

(a,/?)^ [*(«,£)], 
where ^(a, f3) is the vector in C Tl+1 whose fcth component is (fe = 0, ..., n) : 

1/2 



*(a,/?)fe 



>s(a/2)) fc (sm(a/2))" k ■ e l0k 



(122) 
(123) 

(124) 



Observe that the map $> is defined on the whole sphere, i.e. also for a = and a = n (we agree that 
"0° = 1"), and that (*, *) = 1 , i.e. * is normalized (here * : = v£/( a; /?)). 
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By construction, if pr : C™ +1 — {0} — > P(C™ +1 ) is the canonical projection, then we have the following 
commutative diagram, 



S 2 



'B(„,,) 

B{n, q) 



~<n+l 



n+1 



Id 



*m+l 



n+1 



(125) 



where j : B{n 1 q) c -> V n +i is the canonical injection, and where 

\z I 2 

r(zi,...,z„+i)(fc) := -yfa- , where z = (z 1} ..., z n+1 ) . 



in+l 



— {0} — > V n +i is defined by 



Regarding S 2 as an embedded submanifold of P(C" +1 ) via the map prof, we have the following 
proposition. 

Proposition 9.7. Every Kahler function f on S 2 extends uniquely as a Kdhler function f on P(C" +1 ) , 
and the resulting linear map J(f(S 2 ) — > J^(P(C n+1 )) , f *-¥ f satisfies 



forallf,geJtf(S 2 ). 

We will show Proposition 19 . 71 with a series of Lemmas. 



(126) 



Let / : 5 2 -> M be a Kahler function on the sphere, that is, a linear combinaison of l,x,y, z : 
f(x, y, z) = uq + ux + vy + wz . According to the characterization of Kahler functions on P(C™ +1 ) given in 
Lemma 17751 the function / possesses a Kahler extension on P(C™ +1 ) if and only if their exists a Hermitian 
matrix Q(/) € Herm(C" +1 ) such that, in terms of the parameters a <G [0,7r] and (3 <G [0,27r] introduced in 

cm 



u + u cos(a) + v sin(a) cos(/3) + w sin(a) sin(/3) = (*(a, /3), Q(/) • *(a, /?)) 
for all a e [0, n] and all (3 e [0, 2tt] . 



(127) 



Lemma 9.8. Their exists a unique Hermitian matrix Q(/) such that (|127p holds for all a € [0, n] and 
all /3 € [0, 2ir] . It is explicitly given by 



Q(f)kk=u a -u--(^-k), Q(.f)u+i = - y/(n - /)(1 + l)-(v- iw) , Q(/)„ 6 = ; 
n \ 2 I n 

where k = 0, ..., n , Z = 0, ..., n — 1 , a = 0, ..., tt, — 2 and where b is such that a + 2 < b < n . 
Proof. For a, b, k € {0, 1, ..., n} and a <E [0, 7r] , set 

i/9 1/0/ \ a +k / \ 2n — (a+6) 

. <7 afe (a) := (X 2 G) /8 (oos(a/2)) (sin(a/2)) • Q(/) ob , 



(128) 



-* 



^fe(a) : =X7=o C 3,k+j( 
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Observe that C a b = C oa (complex conjugate). By definition of \f (see (|124[1 ), we have 
(*(a,/3), Q(/) ■*(«,/?)) 

™ /„\ X / 2 / n \ 1 / 2 / xa+6, x 2n-(a+&) 

= E (J (?) (cos(a/2)) (sin(a/2)) e^^Q(f) ab 

n n s n — k -. n , n — k -^ 

= £ C a& (a) e^- a > = E ( E C'M+aW ) ^ + E ( E C a , k+a (a) 

n n / v 

= X! C « a (°) + E ( Afc ( a ) ^ +Ma)e- lk ?\ 

a=0 fc=l ^ ' 

n n 

= A (a)+2 Y^ Rcl(A k (a)) • cos(fc/3) - 2 ^ Im(A fc (a)) • sin(fc/3) , (129) 

fe=i fe=i 

and thus (|127[) may be rewritten : 

Uo + u cos(a) + v sin(a) cos(/3) + w sin(a) sin(/3) 

n n 

= A (a)+2 Y^ Rcl(^ fc (a)) • cos(fc/3) - 2 ^ Im(A fc (a)) ■ sin(ft/3) . (130) 

fe=i fe=i 

Since the functions cos(fc/3) and sin(fc'/3) (fc = 0, ..., n and fc' = 1, ..., n) are linearly independent, we obtain: 

A Q (a) = u + ucos(a), 2Rei(^4i(a)) = usin(a) , 2Rcl(^4 fe (a)) = for all fc>2, 
and -2Im(Ai(a)) = wsin(a) , -2Im(^ fe (a)) =0 for all k > 2 . (131) 

From this set of equations, one sees that Rel(^4fc(a)) = Im(^(a)) = for all k > 2 , i.e. Ak(a) = for all 
k > 2 . In view of the definition of A k (a) , we thus have 

^ fn\ 1/2 / 11 \ 1/2 / x 2a+fe/ s2n-(2a+fc) 

E (!) L + fc) (cos(«/2)) (sin(a/2)) Q(/)„, fc+ „=0. (132) 

It is not difficult to show that the functions cos(a/2) fe sin(a/2) Ar_fe (fe = 0, ...,N, N £ N) are linearly 
independent, and thus for all A: > 2 and for all a such that < a < n — k , 

Q(f)a,k+a=Q(f)k+a,a=0. (133) 

Hence, except for the three "central diagonals", all entries of Q(/) vanish ; this corresponds to the third 



equation in (|128[) . 

Now there are still three equations in (|131[) we haven't used, namely Aq(o) = uo+ucos(a) , 2Rel(Ai(a)) 
usin(a) and — 2Im(^4i(a)) = lusin(a) . Using the definitions of Aq{o) and Ai{a) , these equations reads 



2a / \ 2n — 2a 



J2[ )(cob(o/2)J fsin(o/2)J Q(/) 00 = u + ucos{a) , (134) 

r iZ^ /n\ 1/2 / n \ 1/2 / \2a+l/ s2n-(2a+l) 

2 E( a j L + ij (cos(a/2)J (sin(a/2)) Q(/)a,a+i = (« - w) ■ sin(a) . (135) 



Using the identity sin(a) = 2 sin(a/2) cos(a/2) as well as 

■ \ 1/2 / s 1/2 / ,\ n—\ / ,\ / i\ 

' = ■ 1 = ^ cos 2 a/2 ( sin 2 a/2 ,136 

29 



one rewrites (|135[) as 



n-1 



<n- 1 

0=0 



£ I" „ " , ,, Q(/U-(«-H (cos(a/2)) (sin(a/2)) ' =0(137) 



v/(n-o)(o + l) 



2a / \ 2n-2a-2 



from which it follows that Q(/) a ,a+i = 1/" ■ v/(w — a) (a + 1) (v — wo) for all a such that < a < n — 1 . 
Finally, from (|134[) together with the identity 

E(r«)(j (cos(«/2)) 2a (sin( a /2)) 2 '" !a = -| • cos(a) , (138) 

a=0 ^ ' 

one easily obtains the first equation in (|128p . The lemma follows. D 

Lemma 9.9. The map Q : X(S 2 ) -> Hcrm(C Tl+1 ) satisfies 

Q({/,<?}) = -|[Q(/),QQ?)] (139) 

/ raZ//, 3 ejr(S 2 ). 

Proof. By direct computations using (|128[) . D 

Lemma 9.10. For /,.o e JT(S 2 ) , {fig} = ±{f,g} . 

Proof. The lemma is a consequence of Lemma 19 .91 together with Lcmma l7.6l Indeed, using the Lie algebra 
isomorphism u(n+l) — ^ J^(P(C" +1 )) , A t-t £ A given in Lemma [7.61 and the fact that / = £~ 2l Q(/) , we 
see that 

|j^| _ £-2tQ({/, fl }) _ ^ 2 [Q(/),Q(s)] _ £[*Q(/),»Q(a)] _ UiQ(f) ^Q(a)} 

= {r 2. t Q(-//2) ;r 2 lQ (- /2) } = {77^, ^21 = l{/,5}. (140) 

The lemma follows. □ 

Proposition 19 . 71 follows from the last three lemmas. 

Since VectRJx, y, z} = su(2) , the restriction of the map —1/2? Q to VectRJx, y, z} yields an unitary rep- 
resentation su(2) — > u(n+ 1) which is actually irreducible. Hence, by considering the problem of extending 
Kahler functions on S 2 to P(C™ +1 ) , wc have been let to compute the irreducible unitary representations 
of the Lie algebra su(2) , which is exactly what physicists use to describe the spin of a particle. This means 
that we can recover all the probabilities in the Stcrn-Gerlach experiment and that, in fine, all information 
on the spin is encoded in the binomial distribution B(n, q) . 

10 Gaussians and the quantum harmonic oscillator 

Let Af(fi, 1) be the set of all probability density functions defined over f2 := R by 

^;,):=^exp{-^}, (141) 

where £ € f2 and /x £ R . 
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Since pfc/j,) = exp{C(£) + ■ F({) - ip{6)} with C(£) = -1/2 • £ 2 , 9 = fj,, F(f) = £ and ^(0) := 
1/2 ■ 2 + \n(^/2n) , 7V(/x, 1) is an exponential family whose natural parameter is 6 = /x . 

The Fisher metric is easily seen to be constant hp(0) = 1 , which implies that TAf(fi, 1) = C (isomor- 
phism of Kahler manifolds). Moreover, it is not difficult to see that the space J^(C) of Kahler functions 
on C is 

jr(C)=Vect K {l, x, y, X + 2 V } (142) 

(here x and y are respectively the real and imaginary parts of z G C) , with the following commutators 

{1,.} = 0, {x,y} = l, {x,^±^}=y, { y ,^±f.} = - x . (143) 

Clearly, T(Af(n, 1)) is trivial. Hence, N{p, l) c = lW(/i, l)/{e} = C , i.e., 

AA( M ,1) C =C. (144) 

The canonical projection 7^(^,1) : C — > A/"(/x, 1) is easily seen to be 

1 f (z-O 2 ! 

7WGM) : C -> •A/'Cm. !) > "•■V(m,i)( ;8 )(0 = ~^= CX P \ ~ 2 J ' ^ 145 ^ 

where z = a; + iy G C . 

For the spectral theory, observe that -Aa/"(m,i) = Vect]R{l, £} , where £ : R — > R is the identify, and that 
the group of holomorphic isometries of C is the group E(2) = R 2 xO(2) of Euclidean isometrics. As a simple 
calculation shows, the only Kahler functions / on C that can be written as f(z) = L X(£) [(no (f)(z)] dt; , 
where X G •Aj^fa 1) and <f> G E(2) , are functions of the form f(z) = uq + ux + vy , where uo, u, v G R . For 
such function, the subset spec(/) := Im(X) is well defined, i.e. independent of the decomposition of / , 
and so is its associated probability Pf jZ . 

For f(z) = u + ux + vy , calculations yield spec(/) = R if grad(/) 7^ , spec(/) = {uo} if grad(/) = 
and 

exp(- i,f~{S Z X ) if grad(/)^0 



Pf,*={ v^l|grad(/)|| ™*\ 2||grad(/)|| 2 J 5 ^ J lr , (146) 

5 Uo if grad(/) = 



where grad(/) = (u,v) denotes the Riemannian gradient of / and ||grad(/)|| = \/u 2 + v 2 its Euclidean 
norm. 

Let us now relate the above formulas with the quantum harmonic oscillator. Let h be a nonnegative 
real constant and let \& : C — >• C°°(R, C) be the function defined for £ G R and z — x + iy G C by 

*(*)«) - (^I7iexp{ - ^} exp{ 4^} . (147) 

Let us also define a linear map Q from the space J?T(C) to the space of unbounded operators acting on 
L 2 (R,C) by 

l^Id, x^x, y^rK-, —JL * --— + -^ - (_ + -) . (148) 

Observe that Q is "essentially" the operator which quantizes the classical harmonic oscillator. 
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Proposition 10.1. For all f <G ,J€(C) and for all z € C , we have : 

/(*) = (*(*), Q(/) •*(*)>, (149) 

where ( , ) is the usual L 2 -scalar product on L 2 (K, C) . 
Proof. By direct calculations. □ 

Equation (|149[) is the exact analogue of (|127[) . but in an infinite dimensional context. Indeed, in |Molaj 
(work in progress), we regard the space T> := {p : M — > M. \ p smooth, p > and J M p(x) ■ dvo\ g = 
1} of smooth density probability functions on a (compact) oriented Ricmannian manifold (M,g) as an 
infinite dimensional (Frcchet) manifold, and we exhibit the analogues of the Fisher metric and exponential 
connection on T> , obtaining via Dombrowski's construction, an almost Hermitian structure on TT> which 
allows for an embedding C ^ TV C C°°(M,C) similar to that of S 2 ^ P(C n+1 ) in $9] In this approach, 
the map \I/ is obtained by solving a simple partial differential equation related to a particular description 
of the tangent bundle of T> (see [Molaj for details). 
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